Learning non-concatenative morphology
نویسندگان
چکیده
Recent work in computational psycholinguistics shows that morpheme lexica can be acquired in an unsupervised manner from a corpus of words by selecting the lexicon that best balances productivity and reuse (e.g. Goldwater et al. (2009) and others). In this paper, we extend such work to the problem of acquiring non-concatenative morphology, proposing a simple model of morphology that can handle both concatenative and non-concatenative morphology and applying Bayesian inference on two datasets of Arabic and English verbs to acquire lexica. We show that our approach successfully extracts the non-contiguous triliteral root from Arabic verb stems.
منابع مشابه
Evaluating Sequence Alignment for Learning Inflectional Morphology
This work examines CRF-based sequence alignment models for learning natural language morphology. Although these systems have performed well for a limited number of languages, this work, as part of the SIGMORPHON 2016 shared task, specifically sets out to determine whether these models handle non-concatenative morphology as well as previous work might suggest. Results, however, indicate a strong...
متن کاملNON-C ONCATENATIVE DERIVATION Other Processes
This chapter provides an overview of a wide range of non-concatenative (nonreduplicative) phenomena in morphology focusing on a typological categorization.1 The definition of non-concatenative morphology is not uncontroversial. Kurisu (2001: 2) considers non-concatenative morphology to be observed in cases where the phonological instantiation of a morpheme cannot be demarcated in an output repr...
متن کاملA Constraint-based Morphological Analyzer for Concatenative and Non-concatenative Morphology
Morphological analysis in the current methods, such as finite-state and unification-based, are predominantly effective for handling concatenative morphology (e.g. prefixation and suffixation), although some of these techniques can also handle limited non-concatenative phenomena (e.g. infixation and partial and full-stem reduplication). A constraint-based method to perform morphological analysis...
متن کامل